167 results found.
Written
OCR quality assessment of large historical newspaper corpus,
Language Type:
Multilingual
Languages:
French German Luxembourgish
Availability:
Freely Available
License:
CC BY 4.0
Size:
None Production Status:
Use:
multiple uses
-
Paper title:Language Resources for Historical Newspapers: the Impresso Collection
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Maud Ehrmann | Impresso OCR Quality Assessment | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English French German
Availability:
Freely Available
License:
CC BY SA 4.0
Size:
None Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Language Resources for Historical Newspapers: the Impresso Collection
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Maud Ehrmann | Impresso HIPE Shared Task Named Entity Gold Standard | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Multilingual
Languages:
French German Luxembourgish
Availability:
Freely Available
License:
CC BY 4.0
Size:
None Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:Language Resources for Historical Newspapers: the Impresso Collection
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Maud Ehrmann | Impresso Word Embeddings | /N |
Documentation:
None
Written
Topic Modelling Data as extracted from historical newspapers,
Language Type:
Bilingual
Languages:
French German
Availability:
Freely Available
License:
CC BY 4.0
Size:
None Production Status:
Newly created-finished
Use:
multiple uses
-
Paper title:Language Resources for Historical Newspapers: the Impresso Collection
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Maud Ehrmann | Impresso Topic Modelling Data | /N |
Documentation:
None
Written
Text Reuse Data as extracted from historical newspapers,
Language Type:
Multilingual
Languages:
French German Luxembourgish
Availability:
Freely Available
License:
CC BY SA 4.0
Size:
None Production Status:
Newly created-finished
Use:
multiple uses
-
Paper title:Language Resources for Historical Newspapers: the Impresso Collection
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Maud Ehrmann | Impresso Text Reuse Data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
From Owner
License:
INA dataset GCU
Size:
5803 entries Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:French Tweet Corpus for Automatic Stance Detection
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marc Evrard | TweetStanceFr2019 | /N |
Documentation:
https://github.com/ina-foss/tweet-stance-annotation
Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
Not Available
License:
MIT
Size:
350931 tokens Production Status:
Existing-updated
Use:
Named Entity Recognition
-
Paper title:Establishing a New State-of-the-Art for French Named Entity Recognition
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yoann Dupont | FTB-NE | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
Freely available in the future
License:
Size:
2020 sentences Production Status:
Newly created-finished
Use:
Textual Entailment and Paraphrasing
-
Paper title:A French Corpus for Semantic Similarity
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Rémi Cardon | A French Corpus for Semantic Similarity | /N |
Documentation:
None
Written
Terminology,
Language Type:
Multilingual
Languages:
English French Italian
Availability:
Freely Available
License:
Size:
1188 entries Production Status:
Newly created-in progress
Use:
Translation
-
Paper title:On the Formal Standardization of Terminology Resources: The Case Study of TriMED
-
Paper track:Terminology/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Federica Vezzani | TriMED | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English Finnish French German Russian Swedish
Availability:
Freely Available
License:
CC - BY - NC
Size:
2 GByte Production Status:
Existing-used
Use:
Textual Entailment and Paraphrasing
-
Paper title:Comparative Study of Sentence Embeddings for Contextual Paraphrasing
-
Paper track:Evaluation/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Louisa Pragst | Opusparcus | /N |
Documentation:
None




